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ABSTRACT 

The a.evelopment of the Enalish as a Second L>.nguiqe 
Assessment Battery (ESLAB) was ba sed ^on needs indicated by 
^chievfenant arid placement statistics/ The selection of the' 
proficiency skills flisteninq, speaking, readina, and writina) 'wis 
supported by changing views in language assessment and by 
legislation. Andrew Cohen's language model<was adapted as a 
theoretical frame of reference. The battery was develops with a 
xecognition of the interrelationship of ' native, and second language 
skills. Receptive and expressive comt>onents of ESLAB included the 
oral screening rest. Oral Competency Test, Aural Comprehfeiision Tast, 
Dictation Exercise, Structural Competency Test, and Informal Reading 
Inventory. To ascertain ESLAB 's stati stical stability , ^59 .Hispanic 
mner-city bilingual students in arades 7 and 8 were tested. Each of 
the tests was subjected to item analysis, involving item difficulty 
ani^itea discriiination . Face, con|ent, predictive, and concurrent 
v^alLlity were established. Internal consistency and interrater 
correlations for the expressive ^?ubtasts wera also attained, Ov^-all, 
both tha receptive and expressive subtests of the ESLAB pilot edition 
proved to be valid and reliable measures of language proficiency, 
(Author/GDC)' . . ' , 
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Rationale and Theoretical Framework 

The English as a Second Language* Assessment Battery /ESLAB) was de^ 
veloped to meet the need for a valid and reliable criterion-referenced 
instrument to assess the English language proficiency skills of second^ 
ary bilingual students. A l^ilingual student is defined as one who lives 
in a two language envirormentv regardless of how well he/she speaks the 
languages (Zintz, 19X5). The Instrument is designed tcTassess both 
receptive (Listening/reading) and expressive (spek'klng/ writing) language 
skills* It was intended to fc)l used to place student's according to pre-- 
established entry level cat^gbries ia appropriate levels for meaningful 
instruction in English as k Second Language (ESL) and reading- 



Need .for Language Proficiency Assessment 



Several interrelated issues contributed to raising the current con- 
cerns for language proficiency assessment. Among them were achievement , 
4he haphazard placement of minority students in educational programs, 
* legislative developments , and the personal experience working ^^ith 
bilingual students and teachers. In esjLablighing the frame of ^^ference 
for the ESLAB study, each aspect was considered. ^ 



\ 




Achievement* The educational. achievement gt the linguistic m.inority 
student was found to be "consistently below t^he achieiJje'ment of the total 
national age population" (National Assessment,*^ 1977/ p* 5). Among the 
contributing factors to this sit^uation wfere and cop'£inue to be language, 
socibcultural differences, amount of schooling, a^s well as societal atti-* 
tudes toward the non-native Eng*lish speaKfer* XKis educational situation ^ 
went virtually unacknovl edged until a ^overnmjght analysis of the school 
achievement of Mexican-Amer lean students in/£he Southwest demonstrated 
that 8*1 was the average yea^s of schooli^^ for students 14 years of agfe 
and oldeitr (U.S* Comm.^ '^Unfinished Education,'* 1972). Parallel school 
achievement problems ha^fe been reporte^^ for other Hispanic, Asian, and 
Native American groups XU,S. Coimn., "Puerto Ricans,'^ 1976; Lau .vs* Nichols^ 
1974; U.S. Comm. ''Social Indicators/,** 1978), Statistics on the school 
drop-out or **push-out** rate of st^i^ents from non-English" speaking back- 
grounds also underscored the int^pslty of the need fot re--a^sessment of 
the educational situation 'C'The^*fay W,e Go To School,'' 1970; U.S. Comm., 
;^^Exc^uded Student, 1972; Steipter, 1974). ^ ' 
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Placement . Inappropriate placement as a result of indiscriminate 
English assessment p-roceduresr involving intelligence cests also drew at- 
tention to the educational needs of the linguistic minority student. A 
.classic example'of this practice was documented in Mercer.'? (1971) 
Riverside, California, stqdy of the: labeling process. 

Ii> the PsYcbological Testing^ of ^American Minorities (1975), Samuda 
explored theTafnif ications of rel)^ing on intelligence and achievement 
test results in the case 'of minority studefit-s. He analyzed the testing 
controversy and pointed out ^hat although tests have not proven to.be a 
pa'nacea, they will most probably not be abolished. 

\^ 

Fi^shman et . al. in the Guidelines for Testing Minority ISroup Chil -. 
D964) ident-ififed three major issues when using standardized tests 
with minority students: 

1\ Test results may not reliably differentiate am ang minority 
group scores; 

2. Test results may not be predictive for the minority group / 
student as they may be for the middle^class student; / 

5. Test results should be interpreted* by a professional who 
is^aware of the language and sociocultural background of*^ 
the .stuclents. Samuda summarized Tislynan ^ s* cone er^ in/ 
stating thaf 'good test usage depends upon conscient ious ^ 
, methodological, and critical examination of test s'c0Tes'' 
Cp. 17). * ' ^ ^ ' / / 
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De Avila and ,toassy C1974) stress that low level p^spf ormance 'may 
only be a reflecti^ of minimal knowledge of the languXg^', non-mastery 
of English reading skilly ^nd/or unf amiliarity with t^e/ culture of the 
test rather than an index of actual intelligence, 5'6x these reasons, 
language, and cultural background should be considered before Using stand- 
ardized intelligence and achievement measures as indices of potential. 
The level of English language proficiency and the defree -of familiarity 
with testing ^ocedures should be taken into account before using any 
type of standardized English achievement test whicb may Be used for 
placement purpo ses^^. ^ ' \ 



Legislative Developments . The official recognition of language 
isinorities occurred in 1968 when Congress passed, the Bilingual Educa- 
tion Act - Title VII as an amendment to the 1965 Elementary and Secondary ^ 
Education Act. This landmark legislation stipulated that a program of 
instruction be designed to ''taach children in English and. to teach in ^ * 
{the 'na't ive) language so they can pybgress effectively through school" 
(Bilingual Education: An' Unmet Need," 1976, ^. 11) • In this definition, 
it is obvious' that language was a unique coin'ponent * to be oonsider^ed* 

Language wa.^also found to be important in the education of 
limited English-speaking students in the/1974 San Francisco court ^ 
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case, of- Lau vs. Nichols/ In complying \ith the court's opinion, the San^ 
Francisco Unified School^ District with a citizen's task force designed 
guidelines for school districts to'follow in the case ofi, the students 
whose "home 'language is other than English." Some months later Congress 
codified the decision as part of the Equal Educational Opportunity Act 
c>^f 1974 (Teitelbaum and Hiller, 1977) and the Office of Civil 'RightsA|\ jA.. 
adopted guidelines which have 'pome ,to be known as the Lau Remedies (^Ifek 
Force*/' 1975),. They specify that student's through" language xisage question- 
tionnaires be identified as: . - . 

.A.. Monolingual speak*er. of the language other than English 

B. Predominantly speaks the language other than English 

-C. Bilingual 

D. Predominantly speaks Engl ish 

E. Mono-lingUal speaker of English ' 

Based on the general categoi^y in which a "student fall^, educational pro- 
grams are then designed - and" matched to student naeds . 

• While the Biling\*al Education Act revisions of 1974 .and the Lau 
Decision C1974), at the federal level, support the efforts of bilingual 
educators. Congress • continues to-press for results which will validate 
the government's efforts to support bilingual students in the educational 
process. Specifically, statistical dgrta that documents the effects of 
bilingual education is sought. For this reason, in the extensi^on of the 
Bilingual Education Act thtough the Education Amendments of'197^5, there . 
is an effort to clarify who is to be. serviced through ^bilingu'al (education. 
Students- who are eligible fqr bil ingtial education, according to the new . 
regulations, ate no longer define<J as bei^ig of .'^limited English-speaking 
ability" but rather to be of "limited English ^pi-oficiency" (Education 
.Amendments, 1978, p. 69). This alteration in def inition^refocuses the 
previous emphasis on proficjency apart from literacy. In other words, 
language proficiency according to the amendments includes all language 
•skills, i.e., listening, speaking, "reading, and writing. / , 

Experience with Bilingual Teacher sf Prior to legislative mandates, 
t'he experdfence .of working with bilingual students strongly demonstrated 
|tTie need to .organise an approach to language competency assessment • While 
' several gOYpranental ageii^iesi^^^p^^ Service Institute, FSI; Central . 

Intelligence Agency^ tIA; ari?'^the Civil Service Commission, CSC) have 
developed a mode^l for assessment of second language competence fox; a<JuUs 
-learning Jtoreign languages Clones and Spolsky, 1975), a uniform proceto^e 
has yet to be developed fo^^ educators attempting to assess language com-^ 
f^^l^^^'^ student 1^ in bilingual and/or other program? of instruction for^ 
language minor ltf^|)er sons. The reality in the past been that the in- 
dividual classrQpm teacher who attempted to diagnose language skills has 
done so on *an individual basis rather than as part of an organized m^th- 
odologicaj procedure. Additionally, as previously discussed/ the inter- 
related issue of identifying adequate as well as valid test instruments 
have complicated the process. For although numerous formal and informal 
instruments to assess language competence exist, they are seldom compre^ 
hensix^e or organically Integrated^ in design (Silverman, Noa, and Russell, 
1977; Gutierrez and Rosenbach>, 1975), Also, few have been found to be 



designed for secondary srudents- This has created problems for the 
classroom teacher desiring tor assess students' linguistic competencies. 
Not only must skills be catalogued^ but appropriate tests must be found. 
As a result, the assessment procedure employed by a majority of second-.' 
ary bilingual teachers has been tandom. It has lacked uniformity in the 
skills considered significant and in the instruments used to measure 
thm. Finally, ^the proce,^s has been found to ^ be time-consuming and re- 
dious for the classroom teacher who needed an expedient method for asses- 
sing the strengths and- weaknesses of bilingual students (Gonzales, 1979}. 



The Selection of Language Proficiency gkills 

To date, the assessment of language proficiency of bilingual stu- 
dents' has been difficult for two major reasons: 

9 

1. Inadequate identification* of specific language proficiency 
skills necessary to determine the bilingual student ^s abil- 
ity to perform in a monolingual or in a bilingual classroom 
CCummins, lp79) ; Gonzales C1979) confirms this perspective 
in Stat ing^ 'There is no general agreement' ajuong educators 
and/or linguists as to what constitutes eil^r the partic-t 

/ ular functionalities of language which impinge significantly 
on school learnings the structures which situat ionally may 
affect it^ or the specific skills which should be the minima 
given age or- griide" Cp* 13).-* * 

2. Lackji of . identification of prp\^n, valid, and reliable stand-^' 
ardized and/or cr iterion-ref erenced Jj^lts that measure lin- 
guistic competencies of secondary b^ilingu^i^ students in 
native languages and in English (Silvermalri/^Noa, Russell, . 
1977}/ / , ' 

In analyzing the skills of listenings speakings reading, and writings 
Lciban (1963), Horowitz and BerkowitE (1967), Chastain (1976), Wilkinson 
(1971), Wilkinson an4 Stratta (1970), Moscovici (1967), and Higglns (1978) 
indicated that there are interrelationship:^ among the four language skill 
areasl The results^ of the cited studies deinons trated that the receptive 
and expressive language areas i;equire mastery of separate and somewhat 
different skills, • , 

What was to be assessed was considered through a review of commonly 
used terms in language assessment: language dominance, language profi-- 
ciency, and commutiicat iye cqmpe^^^e. Language dominance was found 
generally to ref^r to oral compwKmce* Language proficiency^, on the/ 
other hand^ was found tTo refer to both oracy and literacy skills. 
Communicative competence was found to refer to^^^e social and cultural 
knoi^^ledge an individual is presumed to have enadilng him/her to Interpret 
^al/and/or linguistic forms in a specific situation. These distinctions 
were made in an effort to identify the areas of language that should be 
considered in the development of the . ESLAB . 

Other considerations includ(id the selection of a te^sting philosophy 
and test type; ^ Since the purpose of the ESLAB was to assess language 
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skills in the four skill areas, both discrete point and integrative phi-- . 
losophies were found to be relevant. Applying this dual approa9^ to lan- 
guage assessment, a -person ability to coiamunicate orally or writing 
in a particular situation, as well as the ability to manipulate the cm^ 
ponentg- of language in o^al and wTitt^en form, were taken into^ considera- 
tion. One other major decision iJi the development of the^SLAB was the 
one to adopt a crite:;ion-ref erenced format. Although a language profi- 
ciency instrument could be either norm- or criterion-ref er^inced, the pur- 
pose of the ESLAB seemed more in tune'^with a criterioT?Mr:ef^renced in- 
strumentat ivui. This decision influenced both the design and the develop-^ 
ment of the test. ^ . * /^^^ 

An adapted version of Cohen ^s 0975) model (Figixre 1) was presented 
as the theoretical frame of reference. This model was adppted because 
it demonstrated the interrelationships among the language areas. As . 
shown iji the model, oracy and literacy skill s^ are components of the re« 
ceptive and expressive language areas. The model exemplifies the inter- 
relationships among the language skills, the language components, - the 
language varieties, and the langiaage. domains. As illustrated, although 
the language components are essential for the communicative process, 
they should not be th6ug^lt of in isolation from l|he specific purpose of 
using langia'age at any one time^ The language cojflponents necessary for 
producing meaningful language include knowledge af the sound system 
(phonemes) for oral language and comprehension of the orthographical 
system (graphemes) for wri4:ten language. Language use requires fami- 
liarity ^^ith vocabulary -Qexicon) , inter'Yi^ ligation of .grammatical struc- 
tures or rules of language usage Csyntax}, and the ability to attac^i 
meaning to referents Csemantics). Most important is pragmatics or the 
ability to process sequences of linguistic components and relate them 
to th'e ;)roader context of experience (caller, I'^^O). The other aspects 
of language that are illustrated * in Figure 1 afe the language domains 
'or context within which language can be described and the language 
\*ariety or type of language that can be used in diverse geographical . 
locations or v?ithin the same speech communitv or commanities ^'irishman. 19'"2), 

It is seen that the language skills considered essential for com- 
munication are listening, speaking, readings and writing. In ^ssence^, 
the figure demonstrates the variety of potential interrelationships 
among the skill areas and the language components. 



Language Processing ^ 

The theoretical aspects of the language processing ski}l^s and their 
relationship. to language proficiency assessment were also described 
through a model. Figure 2. ^It was developed in order to illi/strate the 
major cognitive functions necessary to process language. Both, cognitive 
and affective processes were considered. The ability to receive and 
manipula^te language for communication purpose requires that an indiyi* 
dual have the ability to attend, perceive, and remember phonological, 
lexical, and syntactical aspects of '5^ceivod language (jjiput). Through 
the sticcesful process of synthesizing the input and normal motor control, 
expressive language Cotxtput) becomes possible (McLaughlin, 1978). While 
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Figure. 1 
Figure 2 
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Processing and 
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this analysis is somewhat simplified, for those conceri^ed with language 
proficiency, knowledge of the aspects involved in proces'^ing language is 
i.n>portant. This consideration becomes particularly necessary when a stu- 
dent being assessed for language proficiency seems to evidence problems 
related to processing rather, than to language itself. 

^ The affective forces of self-concept, motivation, and attitude also 
-mnuence to what extent individuals activate their potential to manip- 
ulate language as a tool for communication. These factors are further 
documented in second language <L23 acquisition studies (Gardner arid- 
Lambert, 1971; Ervin-Tripp, 1974; Taylor, 1974), ^here it is demonstrated 
that these internal forces greatly interact to influence the acquisition 
of ^competence in the native language CLl) as well as in a second language 
C12}. mth regard to the affective components, it is useful to keep in 
mind that self -concept ,• iiiot ivat ion, and attitude can be modified through 
internal and/or external changes. 



The ESLAB Pilot Test 

The ESLAB instrument was developed with a. recognition of the inter- 
relationship of native -and second language skills. Cummins Cl9'79) pro- 
vided understanding of this relationship through his theory of interd'e- 
pendence. He pointed out that "development of competence in a second 
language CL2) is partially a function of the type of competencfe already 
developed in LI,. when intensive exposure to L2 begins" (Cummins, 1979, 
p. 222}. Through .several studio it was found that the abstract control 
of language wa,s most important in facilitating second language acquisi- 
tion. Literacy skills in the native langauge were found to play a par- 
ticularly important role in arriving at a threshold level of language 
that Would allow the' potentially beneficial^ aspects of bilingxialisro to 
be exhibited. For this feason, although the 'ESLAB was designed to assess 
English language skills, it has been strongly recommended that its admi- 
nistration be accompanied by a native lanjgua'ge examination. 

The, completion of the ESLAB pilot test version took approximately 
two years. Th^: process- involved the development of oblectixes and .se- 
lection of a technique,, mo St appropriate to meet the stated" objectives. 
The content base was .^elected from curriculum being used in ESL and bi- 
lingual programs. Linguist i(^, and cultural backgrounds of the potential 
students to be .assessed were. also taken into account. 

♦ 

The materials developed for the pilot test version were the ESLAB 
Manual, Handbook, and^ Scoring Charts. Each component was briefly des- 
cribed. The ESLAB receptive components included the Aural Comprehension 
Test, the Structural Competency Test and the I nformal* Reading Inventory. 
The ESLAB expressive components included the Oral Screening Test, the 
Oral Competency Test, the Dictation Exercise, and the Writing Sample.. 



ESLAB 



Scores: Derivation and Interpretation . 



In scoring the ESLAB the test administrator first tabulates' student 
results. Once the student's performance is ranked accc<%ding to the 
individual scoring criteria,^ the total points are compared to the 
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criterion for establishing ESL and reading levels (Table 1)* Test ^ / 
components are all translated into levels except for th% 9^al 
Screening Test which is scored as pass/fail. Based on overall 
results, a language profile for each student is established for 
placing students into instructional groupings. 

\ The rationale supporting the need for a' hierarchically arranged 
entry-level criteria was based on^ curricular considerations, experu^^rce 
with bilingual students^ the research which suggests that there is" a 
threshold level of lafiguage competence which must be attainei in order 
to avoid cognitive deficits and to allow the potentially beneficial as- 
pects of bilingualism to affect cognitive functit)ning CCummins, 1977;^ v 
1979), and the research indicatirfg- that there is a hierarchy of lan- 
guage skills CChastain, 1976). 

The levels are defined in terms of the test authors* experience in 
worlcing with bilingual students. They are: Beginner I,' Beginner 11^ 
Intermediate I, Intermediate II, and Advanced. ^ Table 1 provides a ' sum- 
mary of the criterion for the entry level categ^ories. 

i Table 1 

Criterion for Establishing ESL and Reading Levels 



LeVel ESL Ct2) English Reading 



Beginner I 


Illiterate student.' in Ll 
may or may not have aural/ . 
oral L2 skills 








Beginner 11 


Literate student in Ll 
may or may 'not have aural/ 

oral L2 skills ^ 

> 






t 


Iiitei|[iedi^te I 


Literate student in \A has 
basic aural/oral English 
skills, limited L2 literacy 
skills 


Reads 


at 


A 

2nd gr. level 


Intermediate II 


Literate student in H hafs 
basic aural/oral L2 skills^ 
limited English literacy 
skills 


Reads 
level 


at 


3rd^5th ^r-* 


Advanced 


Literate student in Ll/ 
literate in English 


Reads 


on 


grade level 


§ 











The beginners levels are subdivide<| in order to accoiranodate two very 
differently equipped types of students'. | Beginner I is for students who 
may or may not be conversant in Engllsfiv The common denomiriat5>r for this 



eroup is the lack of literacy skills in either the native ILI) or m 
"^he second language (L2), Beginner II includes studentswho may or 
mav no"c possess aural/oral skills in -English, but vho are literate 
to' at least a second grade level in their native language. Inter- 
njediate rstudents are def^tied as those students who have aural/oral 
skills In English and who capable of reading English 'at 2nd grade 
level. InteY-mediate students possess aural/oral "English skills and 
are able to read in English at 3rd -to 5t.h grade level. .4dvanced stu- 
dents possess aural/or^l English- skills- and can read on grade level. 

- Description of tKe ESL4B Tests .* . • 

-~7 . Oral Screening (O.S.) Test . The Oral Screening Test is composed of 
-10 inforinatiohal questions. determining a studen|:'s performance, com- 
• .prehension and production are 'evaluated on a parallel rating scale by the 
test adiDinistrator. Administered individually, it is tape recorded in 
order to f aciliJtate scoring. Judgement is made on how veil the student 
comprehends information and is able to respond appropriately. Comprehen- 
sion is determined by a student's ability to respond accurately Cbut not . 
necessarily in extended discourse)^ A parallel procedure is followed for 
evaluating 'a student's overall production. The overall performance level 
■ ' is ,base4 on the actual number of correct responses defined by a predeter- 
mined critefi*. Scored on a pass/fail basis, it is recommended that stu- 
dents who- pass the Oral Interview continue to the Oral Competency Test. 

Oral Compete'ncy CO-C.> Test . The .Oral Competency Test was based on 
a qua si^reali Stic activity and was designed to assess a student's degree 
of oral competency through th« oral interpretation of a visual stimulus. 
Questions concerning the visual stimulus were developed by the authors 
in order to assess overall coiranunicalive competence. The test is admi- 
nistered individually and ,a tape recorder is recommended to facilitate 
evaluation of the student's discourse. The -performance level is based 
on rating? of the student's overall control of phonology, vocabulary, 
, grammar, -and fluency as. defined on a predetermined criteria. Upon com- 
pletion of the test, all students, regardless of the score, continue 
with the Aural Comprehension Test. T.his is recommended because often in 
the caSeSf ^second language learners aural coaiprehension far exceeds pro- 
ductive abilities (Chastain, 1976)", v' 
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Aural Comprehension (A-C.) Test . The Aural Comprehension Test, was 
specifically designed to assess listening comprehension. ' It was modeled 
on the Pillsbury, Thrasher, Upshur (1963) Aural Comprhension Test . The 
sent^fte structures in this test were selected as a gul-de because they 
were repc^entative of those necessary to function In English. The ESLAB 
Aural Comprehension Test consists of 25 statements. Each statement is 
accompanied by four picture frams from which the student selects the 
Appropriate picture.- It is administered as a group test and scored accord- 
■ ing to the number of" correct' responses . The overall performance level is 
^ based on the actual number of correct responses as defined by the pre~ 

determined criteria. Regardless of the. achievtd level, it is recommended 
' that students progress to the Dictation Exercise so that basic liter- 
acy skills level can be assessed. 

• Dictation Exercise XD.E.) . Dictation involves the examination of a - 

9^ ^ m 



student's English listening and writing skills or. mote specifically, a 
stiidenr»s ability to reproduce sounds, structures, vocabulary, and sto- 
ries. It consists of 12 words and a short paragraph. The words used 
were taken from the Dale-Chall 0948} word' list. Administered as a group 

, test, it requires that students write the' list of words and a- short 
paragraph. The performance level is determined by rating the student *s 
control of word usage,- spelling, and grammar as defined^on a predeter- 
mined criteria. Oiler's (1970). study indicated that the student's per- 
formance was an indication of the ability to recognize phonological and 
orthographical structure of the English language. Thus, if the student 

^ does not perform well on the dictation exercise, two possibilities sfiould 
be explored: either the student is literate in his/her native language 
but needs aural/oral drill before being Tahle to read and write in English, 
or the student is assumed to be illiterate in any language while he /'she my 
or may not ,posses<^ high degree of aural/oral skills in English. Stu- 
dents functioning at a Beginner II level on both the Aural Comprehension 
Ti-st and the Dictation Exercise are considered to be sufficiently liter-" 
are to continue vrith the Structural Competency Test. 

Structural Competency jS.C.) Test . The Structural Competency Test 
was developed to measure knowledge of English s>'ntactical structures re- 
quiring students to visually , identify the syntactical patterns of English.. 

. Tbis test is modeled after the Michigan Aural Examination (pillsbury et. 

\al . , 1963] and the English Sentence Structure curriculum developed by 
krohn et. al. (1971).' These references used by bilingual *and ESL teach- 

''"^ers as a guide for teaching English structures provided the guidelines 
for the structures selected. In its administration, the student is given 
a structure and four multiple choice responses from which to select. The 
student's performance is evaluated according to the actual number of cor- 
rect responses as defined by a predetermined criteria. It is recommended 

/that students functioning at an Intermediate I level or above continue to 
the Informal Reading Inventory CiRIj . ' 

Informal Reattlfcg Inventory (JKl} . The ESLAE Informal Reading 'Inven-\ 
tory ilKl} consists of a set of eight original passages which, range in 
length from 30 to 200 words-. The Fry (19681 and Dale-Chall Cl948} read- 
ability formulae were used to level the passages from primer to eighth 
grade; Ten multiple choice questions for each passage were constructed 
in order to evaluate the student's comprehension level. They were based 
on Barrett's Taxonomy CSmith and Barrett, 1974). The IRI is' administered 
as a group test. .Students read the graded stories silently and then 
answer the questions at the end of each selection. They are advised to 
stop when they no longer comprehend. 

Tbe overall reading level is then established using the following 
^ criteria: Independent: 95-100%; Instructional: 75-94%; Frustration: 
Q-74%. Students with an instructional level of at least second grade^ 
^ Intermediate I. level, are asked to complete the Writing Saniple. This 
procedure is recomaended because students who are not capable of reacting 
should not be forced to independently construct ^ written passage* 

* ■ ■ 

Writing Sample C^-S.} . Constructed as an unfinished story. The EStAB 
Writing Sample starter was taken from- thfe book The Spider Plant CSpeevack, 
1968),. The test is administered as a group test and the performance 



Jlevel isi determined through an evaluation of the student's control of 
word usage, spelling, grammar, and organiEation. 

Although ^he skills examined by the ESLAB tend to overlap, the tests 
have J^een arranged according to the difficulty invalvid in producing 
language » Students are asked to listen and speak, and then read and 
write. The ESLAB Tables of Specifications CTables 2 and ^.3) outline the 
skills assessed in both the receptive and expressive areas- ' 
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^lida^ion of th? ESIAB < . ' 

^^^^^^^^ validate the ESLAB tests, pilot i:esting was conducted 
ir^aJn inner city area comprised . of a large Hispanic, population. ' Ini- 
tially, the examiners c^ojTsi sting of, bilingual and ESL teachers'' were * 



trained In the administrap;^.on and scoring of, the battery, through three 
teacher ^training vdrkshops. Then a sample hi 59 Hisp^ic bilingual stu- 
dents ages 12 to 1|S* —se^^enth and eighth graders-- wai tested. When all 
the data had been collected, ^the results for each of the seven tests were 
analyzed. Item analysis was performed and val^ity and reliability were 
established, . ^ * ' . 



Item Ahalysis > Testing experts CThorndike and Hagen, 1969; Popham, 
1971; Litchman, 1973; Sax, 1974} varied in opinion as to the appropriate- 
procedure for item analysis of criterion-referenced measures. For this 
reason, item analysis for the Receptive and Expressive areas of the ESLAE 
was based upon the test developerii judgement of appropriate procedures 
and empirical data. For the receptive area, the difficulty index >Ci* e. , 
the percentage (p) of students \orrectly responding to an item) and the 
discrimination indext. Ci- e. ^eo^^elation between item and overall test 
results' or the pbifit biserial correlation CRPB}) were obtained. The p 
and the RPB values are described in Table 4. ' • * ^ 



IteiT^ Analysis Surnfl|jjaj 


n ESLAB 


Receptive 


Subtests 










Aural Comprehension Test 
Structural Competency Test 
Informal Reading Inventory 


\ 

- 0 

21.1-89'. 
1,7-64. 
0.0-69. 

/ — ' — 


9% 
4% 
5% 


.08-, 67 
-.01-. 47 
.00-. 70 



n ^ 59 



j^Percehtage or difficulty index 
Point biserial correlation 



The resul ts Indicated that items on the Aural Com^^rehension Test 
were easy in contrast to^ the results of the Structural" Competency and 
Informal Reading Inventory. Overall student p^^rformance on the Aural 
Comprehension Test was high. Indicating that students had mastered the 
content. The significantly lower performance on the other two tests 
indicated that students ' aural' skil^ were much stronger than was their 
abilitv to read and write. 



For the e:5^pressive sub 
the analysis for each item 
formed on th(^ basis of the 



>sts which are evaluated on a bating scale > 
iras" ylewed according to how each student per- 
;et ^criteria* 




■ For the Expressive Area, itejh quality was indicated l!>y a high cor- 
relation between subtests and the total score of each test (Nelson, 1974) 
Students were rated on^ a scale from poor to excellent. The results in- 
dicated whether the high achievers *on each^tem were the high achievers , 
on the specific test. The only test exeuijgped from this' process was the 
Oral Screening Test, which was rateS on a pass/fail basis. The item 

analysis results are summarized on Table 5. 

\ 

Table 5 

Iteii! Analysis Summary - ESLAB Expressive Subtests 



/ 



Ratings 



\ 




Poor> 


Fair { 


Good 


Excellent"^ 






Oral Competency Test 


- 7.70 


17.48 


53.30 


23.35 






^ictation Exercise 


14.70 


25,80 


35.03 


6.62 


> 




Writing Sample 


. 6.45 


48.72 


. 16.10 


6. 35 





n = 59 



ERIC 



Item discrimination for each of the four 'tests was significant at 
•p<.01. Overall, students 'performed better on aural/oral tests than the 
"literacy tests. 

Validity . Validity is the extent to .which a test measures what it 
purports to meastire (Heat on, 1975). For the ESLAB, validity was bas-ed 
on curricular and empirical analyses (Green, 1975) , Curricular consider- 
^flons included face and content validity, analyses. These were deter- 
mined by having examiners and examinees eValuate the tests and by having 
language and reading experts examine each of the items. In most cases, 
items were rated favorably; .where they were, not; suggestions were made 
for altering the items. Empirical analysis was also of two types. The 
firstvwas predictive validity for which Kendall's tau (Nie, et . al . , 
1975) was used to correlate each^test coropopent ' s level results (Tables 
6, 7, and 8) with four Teacher Estimates (T.E.) and ESL report card grades. 
Results indicated positive correlations at p<-01 in most cases. The 

second type was concurrent validity (Table 9), used only for the IRI. A 
Pearson Coefficient Indicated a high correlation or close relationship 
between the IRl and the other three reading tests indicating that they 
may'have produced similar ^results. The Teacher "Estimate had a negative 
correlation with the IRI possibly because teachers tended to underesti- 
mate students* reading levels. 

. . \ 

\ 
\ 



TabU 



ESLAB Tests . . 

Predictive Validity Correlations Between Teacher Estimates, Level 
Performance in the Express ive\ Language Area Tests, and. ESL Grades 



Teacher 


Estimates 




^ _3 

Level Performance 


ESL^ 


Teacher/C opponent 




O.C, D.E. W.S. 


Grades 


1 


O.C. 

W.S. 




.30893** 

.38769*. 
.24947* 


.41657** 


2 


- O.C. . 
D.E. 
W.S: 


• 


.34174** 

.44880** .48658** .32423** 
.37337** 


.37693** 
..36109** 


3 


O.C. 
D.E. 
W.S. 




.25211** 
.23859** 


.32598** 


^ 4 


O.C. 
D,E. 
W.S. 




.75003** , .45560** 

.51110** 










Tab\e 7 




Predictive Validity 
Performance in the 


ESLAB Tet^ts 

' Correlations Between Teacher Estimates, Level 

Receptive Language Area Tests, and ESL IGrades 

if 


Teacher 


Estimates 




Level Performance 


ESL° 


Teacher/Component 




.A.C. S.C. IRI 


Grad-es 


1 


A.C. 
S.C. 

IRI 




— 1 

. 258S>9* 

. .33405** .25874* 


.41850** 
.35967** 


2 


A.C. 
S.C. 
IRI 




.35714* .28360* 
.48658** .32880** ' 
.57050** .34568** 


.37693** 
.24301* 


f" 


A.C. 
S.C. 
IRI 




.32931** 
.34397** 


29860** 


4 


A.C'. 
S.C. 
IRI 




.65049** .50581** 





/ 



n = 59 

*pc6s 
**p^:.oi 



^English as a Second Language 



V 



-!5 



/ 



/ 



\ 



/ ' Table 8 

PreciictiYe Validity- Correlations: Receptive vand Expressive^ 
/ Subtests Results with ESL Grades * ' 



\ 
\ 



Recep/t ive 



ESL Grades 



Expressive \ 



ESL^ ' Grades 



A.c, 



s.c. 



\ 



IRI 



37662** 
25200** 
57600**' ^ 



O.C. 
D.E. 
W.S., ■ 



.25923** 
.29942** 
.2334"** 



;/n = 59 
^*p<.01 

i 



^English as a Second Language 



Table 9 

Concurrent Validity for the ESLAB Informal Reading . Inventory 

™ — — i 

^ * Cloze Stanford Diagnostic Teacher 

\ Test ^ Reading Test Estimate 



Informal Reading 
Inventor\^ 



7484 



6203 



.045^6 



Close Tes't 

Stanford Diagnostic 
Reading Test 



.6105* 



.3577 



4248^ 



\ n = 59 
*pCQS 
**^<.01 



Reliability , Reliability is defined as the accuracy with wftich a 
test measinres whatever it is intended to measure CThorndilce and Hagen, 
1969}. For the ESLAB,^ two types of reliability indices, internal con- 
sistency ^nd interrater reliability were obtained. The index of inter- 
nal consistency **is an index of the consistency of the subtests, or the 
degree to which the subtests measure the same thing" OJelson, 1974, p. 
280}. In other words, internal consistency indexes whethei" a total test 
score measuriRs conmon skills rather than individual distinct ones> In 
this study this means that all Items should be mea^ring linguistic 
skills and not, for example,' intellectual functioit'ing, or attitudes* The 
index of reliability was determined for both the |leceptive and Expressive 
Language Areas (Table 10) through computation of fljpyt*s anova and Cron- 
baches alpha. 



J 



Table 10 



/ / 

-^Ul — 



ReliabiHt/ for the Receptive and 
ESLAB Expressive Subtests 



Test 



I 



Receptive 



Wuxal Coiifprehensiori' /' 

/ / 

Structi;ral Compe^Wcy 

/ / / / 

Informal Reading/tnventory 

// ^ ' 
Total Test Stil>istics 

// 



./ / 



y Exp 



ressivi^/ 

/'/• 



Oral ^breening 
Oraf/ Competency 

i 

Di<y'tation 
Writing Sainple 
/rotal Test Statistics 



CoefficieTit Range 



Items 


Mcfui 




Low 




Hoy t » s 
anova 




CroT^bach 
alpha 


27 


19.46 


4.67 


0 


26 


> .81 ' 


1.97 




52 


9. 92. 


3 . 04 


0 


15 


. .37 


2.39 




80 


1 1 . 14 


6.61 • 


0 


22 


.83 


2.74 ' 


.74 


159 
















39.78 


9.31 
« 


18 


58 


\79 


4.25 


.21 



20 


53.80 


10.11 


0 


67 


.92 


2.75 


.4 


1 1 . 76 


2.39 


0* 


16 


.78 


.96 




7. 73 


2.92 


0 


12 


.90 


.74 


4 


7 . 08 


4.22 




IS 


.94 


.86 


31 


80.37 


13.89 


19 * 


99 


.91 


4.17 



,67 



i 



ERIC 



17 



n =,59 



Standard De-\^iation 

Standard Rrror of Measurement 



18 



Interrater reliability was established only for the expressive sub- 
tests.. For 't^i9 purposje, ch^ Reliability of the ratings of three raters 
were cor^elai^d on $'samplf^of 30 students. This procedure Xs'as ..followed 
in order to StMre their""reliability in assessing students. App'roximately 
half of the students tested pere randomly selected to be SAibjects for 
obtaining interrater reliability (Table 11). 

f ' • 

. , ' Table 11' "'-^^■'H^ 

ESLAB Interrater Reliability Sufrimary 



: i-ib^ - ■ ■ — ■ — - — — - — — . — — - — — — — ■ ' ■ ■ 

standard Standard Error Hoyt's^ 
Mean Deviation of Measurement Anova. ^ 



'Oral Screening 164.40 13.50 ^ 3.96 .91 

Oral Competency 75.63 5.85 |r' 2.85 .76 

Dictation " 25.03 ■ 7.25 ~' 1.97 .92 

Writing 26.97 3.50 2.34 .51 



n = 30 



On the whole, the interrater reliability coefficients were quite, sig- 
nificant demonstrating a high degree of agreement among r-aters. 

Overall, the ^even subtests were reliable' with the exception of the 
Structural ConpeteiWxN Test . On the basis of positive, content and predic- 
tive validity resulTS, it was retained as part of the entire battery with 
a strong recommendation for'^-faeld testing'. 



Standard E rror of Measurement (SEm) . The SEm values for the Receptive 
^ Te^ts were Aural Comprehension 1.97, Structural Competency 2,39, and 
IRI 2.74 with an overall value of 4.25. ^ 

The Expressive Area results were: Oral Screening Test 2.75, Oral 
Competency Test .96, Dictation Exercise ,74, and Writing Sample .86, with 
a total score of 4.17. In both instances, values were fairly low indi- - 
eating the tests to be a fair assessme^nt of student ability. 



Significance of the Study 

The importar^ of the study is that: Ca)i it meets the critical need 
, for an integrated criterion-referenced laiigua|e assessment battery; Cb) 
it provides a basis for evaluating language areas that are significant 
for learning to read in Englishr Cc) it identifies indepen4«itt , instruc- 
tional, and frustr'ation levels for the purpose of gr6upir(g. students in 
reading. Additionally, the use of the ESLAB provides curriculum devel- 
opers with an understanding of achieved and non-achieved student skills, 
and school systems with potential evaluative information of their bilin- 
gual program. 
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